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Abstract 



Let G be a plane graph of n nodes, m edges, / faces, and no self-loop. G need 
not be connected or simple (i.e., free of multiple edges). We give three sets of coding 
schemes for G which all take 0{m + n) time for encoding and decoding. Our schemes 
employ new properties of canonical orderings for planar graphs and new techniques of 
^ , processing strings of multiple types of parentheses. 

J> I For applications that need to determine in 0(1) time the adjacency of two nodes 

and the degree of a node, we use 2m + (5 + ^)n + o{m + n) bits for any constant 
, A; > while the best previous bound by Munro and Raman is 2m + 8n + o{m + n). If 

CN| I G is triconnected or triangulated, our bit count decreases to 2m + 3n + o{m + n) or 

2m + 2n + o(m + n), respectively. If G is simple, our bit count is |m + (5 + ^)n + o(n) 
for any constant A; > 0. Thus, if a simple G is also triconnected or triangulated, then 
^ I 2m + 2n + o{n) or 2m + n + o(n) bits suffice, respectively. 

J-^ ' If only adjacency queries are supported, the bit counts for a general G and a simple 

G become 2m + + o{m + n) and |m + 5n + o(n), respectively. 
^ I If we only need to reconstruct G from its code, a simple and triconnected G uses 

I ^ '°|^ ^ m + 0(1) ~ 2.38m + 0(1) bits while the best previous bound by He, Kao, and 

' Lu is 2.84m. 
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1 Introduction 



This paper investigates the problem of encoding a given graph G into a binary string S with 
the requirement that S can be decoded to reconstruct G. This problem has been extensively 
studied with three objectives: (1) minimizing the length of S", (2) minimizing the time needed 
to compute and decode S, and (3) supporting queries efficiently. 

As these objectives are often in conflict, a number of coding schemes with different trade- 
offs have been proposed. The standard adjacency-list encoding of a graph is widely useful but 
requires 2m[log?7,][] bits where m and n are the numbers of edges and nodes, respectively. 
A folklore scheme uses 2n bits to encode a rooted n-node tree into a string of n pairs of 
balanced parentheses. Since the total number of such trees is at least 2(n-i) " (n-i^^n-i)! ' 
minimum number of bits needed to differentiate these trees is the log of this quantity, which 
is 2n — o{n) by Stirling's approximation formula. Thus, two bits per edge up to an additive 
o(l) term is an information-theoretic tight bound for encoding rooted trees. The rooted trees 
are the only nontrivial graph family with a known polynomial-time coding scheme whose 
code length matches the information-theoretic bound. 

For certain graph families, Kannan, Naor and Rudich |12| gave schemes that encode 
each node with O(logn) bits and support 0(logn)-time testing of adjacency between two 
nodes. For dense graphs and complement graphs, Kao, Occhiogrosso, and Teng [|l7l devised 
two compressed representations from adjacency lists to speed up basic graph techniques. 
Galperin and Wigderson and Papadimitriou and Yannakakis investigated complexity 
issues arising from encoding a graph by a small circuit that computes its adjacency matrix. 
For labeled planar graphs, Itai and Rodeh gave an encoding of |n log n + 0(n) bits. For 
unlabeled general graphs, Naor gave an encoding of — nlogn + 0{n) bits. 

Let G be a plane graph with n nodes, m edges, / faces, and no self-loop. G need not 
be connected or simple (i.e., free of multiple edges). We give coding schemes for G which 
all take 0{m + n) time for encoding and decoding. The bit counts of our schemes depend 
on the level of required query support and the structure of the encoded family of graphs. In 
particular, whether multiple edges (or self-loops) are permitted plays a signiflcant role. 

For applications that require support of certain queries, Jacobson [Tl|] gave an 6(n)-bit 
encoding for a connected and simple planar graph G that supports traversal in B(logn) time 
per node visited. Munro and Raman pO[ recently improved this result and gave schemes to 
encode binary trees, rooted ordered trees and planar graphs. For a general planar G, they 
used 2m + 8n + o(m + n) bits while supporting adjacency and degree queries in 0(1) time. 
We reduce this bit count to 2m + (5 + + o(m + n) for any constant A; > with the 
same query support. If G is triconnected or triangulated, our bit count further decreases to 
2m + 3n + o(m + n) or 2m + 2n + o{m + n), respectively. With the same query support, we 
can encode a simple G using only |m + {5 + + o{n) bits for any constant k > 0. As a 
corollary, if a simple G is also triconnected or triangulated, the bit count is 2m + 2n + o{n) 
or 2m + n + o(?T,), respectively. 

If only 0(l)-time adjacency queries are supported, our bit counts for a general G and a 
simple G become 2m + yn + o(m + n) and |m + bn + o{n), respectively. All our schemes 
mentioned so far as well as that of |2^ can be modifled to accommodate self-loops with n 



^AU logarithms are to base 2. 
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Figure 1: This table compares our results with previous ones, where n is the number of 
nodes, m is the number of edges, and A; is a positive constant. The lower-order terms are 
omitted. All but row 1 assume that G has no self-loop. 



additional bits. 

If we only need to reconstruct G with no query support, the code length can be substan- 
tially shortened. For this case, Turan used 4m bits for G that may have self-loops; this 
bound was improved by Keeler and Westbrook |jl8[ to 3.58m bits. They also gave coding 
schemes for several important families of planar graphs. In particular, they used 1.53m bits 
for a triangulated simple G, and 3m bits for a connected G free of self-loops and degree-one 
nodes. For a simple triangulated G, He, Kao, and Lu improved the count to |m + 0(1). 
Tutte gave an information-theoretic tight bound of roughly 1.08m bits for a triangulated 
G. For a simple G that is free of self-loops, triconnected and thus free of degree-one nodes. 
He, Kao, and Lu improved the count to at most 2.84m. We further improve the bit count 
to at most ^^^m -1-0(1). Figure |l] summarizes our results and compares them with previous 
ones. 

Our coding schemes employ two new tools. One is new techniques of processing strings 
of multiple types of parentheses. This generalizes the results on strings of a single type 
of parentheses in ||20|. The other tool is new properties of canonical orderings for plane 



graphs. Such orderings were introduced by de Fraysseix, Pach and Pollack and extended 
by Kant |[13|. These structures and closely related ones have proven useful also for drawing 
plane graphs in organized and compact manners |T3, IB, Z^, 21 . 

Section ^ discusses the new tools. Section ^ describes the coding schemes that support 
queries. Section | presents the more compact coding schemes which do not support queries. 
The methods used in ^ and §|| are independent, and these two sections can be read in the 
reverse order. 

Remark. Throughout this paper, for all our coding schemes, it is straightforward to verify 
that both encoding and decoding take linear time in the size of the input graph. Hence for 
the sake of conciseness, the corresponding theorems do not state this time complexity. 
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2 New Encoding Tools 



2.1 Basics 

A simple (respectively, multiple) graph is one that does not contain (respectively, may con- 
tain) multiple edges between two distinct nodes. The simple version of a multiple graph is 
one obtained from the multiple graph by deleting all but one copy of each edge. 

In this paper, all graphs are multiple and unlabeled unless explicitly stated otherwise. 
Furthermore, for technical simplicity, a multiple graph is formally a simple one with positive 
integral edge weights, where each edge's weight indicates its multiplicity. 

The degree of a node w in a graph is the number of edges, counting multiple edges, incident 
to V in the graph. A node w is a leaf of a tree T if w has exactly one neighbor in T. Since T 
may have multiple edges, a leaf of T may have a degree greater than one. We say v is internal 
in T if f has more than one neighbor in T. See for other graph-theoretic terminology 
used in this paper. 

For a given problem of size n, this paper uses the log n-bit word model of computation in 



TT|,[T9|,^, where operations such as read, write, add and multiply on O(logn) consecutive 
bits take 0(1) time. The model can be implemented using the following techniques. If a 
given chunk of O(logn) consecutive bits do not fit into a single word, they can be read or 
written by 0(1) accesses of consecutive words. Basic operations can be implemented by 
table look-up methods similar to the Four Russian algorithm M. 



2.2 Multiple Types of Parentheses 

A string is binary if it contains at most two kinds of symbols; e.g., a string of one type of 
parentheses is a binary string. 

Fact 1 (see [@,|^) Let k = 0(1). Given any strings Si, S2, ■ ■ ■ , Sk with total length 0{n), 
there exists an auxiliary binary string A such that 

• the string A has O(logn) bits and can be computed in 0{n) time; 

• given the concatenation of X, Si, S2, ■ ■ ■ , Sk as input, the index of the first symbol of 
any given Si in the concatenation can be computed in 0(1) time. 

Let Si + S2 + ■ ■ ■ + Sk denote the concatenation of A, 6*1, 6*2, ... , Sk as in Fact |1]. 

Let S* be a string. Let 15*1 be the length of S. Let S[i] be the symbol at the i-th position 
of S. S[k] is enclosedhy S[i] and S[j] in S if i < k < j. Let select(S', z, □) be the position 
of the i-th □ in 5*. Let rank(S', k, □) be the number of D's before or at the k-th position of 
S. Clearly, if k = select(5', i, □), then i = rank(S', k, □). 

Now let S" be a string of multiple types of parentheses. For an open parenthesis S[i] and 
a close one S[j] of the same type where i < j, the two match in S if every parenthesis of the 
same type that is enclosed by them matches one enclosed by them. S is balanced if every 
parenthesis in S belongs to a matching parenthesis pair. 

Here are some queries defined for S: 
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• Let match(S', i) be the position of the parenthesis in S that matches S[i]. 

• Let firstfc(S', i) (respectively, \a.stk{S,i)) be the position of the first (respectively, last) 
parenthesis of the k-th type after (respectively, before) S[i]. 

• Let enclosefc(S', ii, 22) be the positions {ji, 32) of the closest matching parenthesis pair 
of the fc-th type that encloses S[ii\ and S[i2\. 

The answer to a query may be undefined; e.g., match(S', i) is undefined for some i if S" is 
not balanced. If there is only one type of parentheses in S", the subscript k in firstfc(S', -i), 
lastfc(S',2), and enclosefc(S', i, j) may be omitted; thus, first(S',2) = i + l and last(S','i) = i — 
If it is clear from the context, the parameter S may also be omitted. 



Fact 2 (see [|,[T|,|2g]) 



1. Let S be a binary string. An auxiliary binary string fJ^iiS) of length 0(15*1) is obtainable 
in 0(15*1) time such thatTa.nk{S,i, □) anc? select (5, z, □) can be answered from S + fii{S) 
in 0(1) time. 

2. Let S be a balanced string of one type of parentheses. An auxiliary binary string /i2(5) 
of length o{\S\) is obtainable in 0(|5|) time such that match(5, z) and enclose (5, i,j) 
can be answered from 5* + fJ'2{S) in 0(1) time. 

The next theorem generalizes Fact 0. 

Theorem 2.1 Let S be a string of 0(1) types of parentheses that may be unbalanced. 
An auxiliary o{\S\)-bit string a{S) is obtainable in 0(|5*|) time such that rank(5*, i, □), 
select (5*, i, □), firstfc(5*, z), lastfc(5*, i), match(5*, i), and eiac\osek{S,i, j) can be answered from 
5* + a(5*) m 0(1) time. 

Proof. The case of rank(5*, i, □) and select(5*, i, □) is a straightforward generalization of 
Fact KIID. The case of firstfc(5, i) is proved as follows. Let /(5, z, □) be the position of the 
first □ after S[i]. Then, 

f{S,i,a) = select(5,l + rank(5, «,□),□); 
firstfc(5,i) = min{/(5,ii, (),/(5,z, ))}, 

where ( and ) are the open and close parentheses of the k-th type in 5, respectively. The 
case of lastfc(5*, i) can be shown similarly. 

To prove the case of match(5*, i) and enclose fc( 5*, i,,;'), we first generalize Fact 0(|]) for an 
unbalanced binary 5*. Let R be the shortest balanced superstring of 5*. Let d = \R\ — |5*|. 
R is either 5* appended by d close parentheses or d open parentheses appended by 5*. Let 
/3(5*) be fi2{R) appended to 1 + [log(n + 1)] bits which record d and whether 5* is a prefix 
or a suffix of R. Then, a query for 5* can be answered from 5* + /9(5*) in 0(1) time. 

Now suppose that 5* is of £ types of parentheses. Let 5*^ with 1 < k < i he the string 
obtained from 5* as follows. 
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• Every open (respectively, close) parenthesis of the k-th type is replaced by two consec- 
utive open (respectively, close) parentheses of the k-th type. 

• Every parenthesis of any other type is replaced by a matching parenthesis pair of the 
k-th type. 

Each Sk is a string of length 2|5'| using one type of parentheses. Each symbol Sk[i] can be 
determined from 5'[[i/2j] in 0(1) time. For example, 

S = [[({)]({}}(]) 
Si = ()()((()))()((()()()((())) 

^2 = [[[[□[][]]][][][][][]]][] 

^3 = 000{{00{}{{}}}}{}{}{} 
A query for S can be answered by answering the queries for some Sk as follows. 

• match(5', i) = [match(S'fc, 2i)/2j , where S[i] is a parenthesis of the k-th type. 

• Let i and j be two positions. Let 

A = {2i,2i + l,match(5'jfc,2i),match(5'fe,2i + l)}U 
{2j, 2j + 1, match(>Sfc, 2i), match (>Sfc, 2j + 1)}. 

Let ii = mm A, ji = max A, and (^2,^2) = enclose (5*^, ii,ji). Then, enclosek{S,i, j) = 
(L.2/2J,Lj2/2J). 

Each query above on Sk can be answered in 0(1) time from Sk + f3{Sk). Since each symbol 
Sk[i] can be determined from >S'[|_i/2j] in 0(1) time, the theorem holds by letting a{S) = 
P{S,)+f3{S2) + --- + P{Se). □ 

Given k strings Si, . . . ,Sk of 0(1) types of parentheses, let a{Si, 5*2, ... , Sk) denote 
a{Si)+a{S2) H ha{Sk). 

2.3 Encoding Trees 

An encoding for a plane graph G is weakly convenient if it takes linear time to reconstruct G; 
0(1) time to determine the adjacency of two nodes in G; 0{d) time to determine the degree 
(i of a node; and 0{d) time to list the neighbors of a node of degree d. A weakly convenient 
encoding for G is convenient if it takes 0(1) time to determine the degree of a node. 

For a simple rooted tree T, the folklore encoding F{T) is defined as follows. Initially, 
F(T) is a balanced string of one type of parentheses representing the preordering of T. An 
open (respectively, close) parenthesis denotes a descending (respectively, ascending) edge 
traversal. Then, this string is enclosed by an additional matching parenthesis pair. Note 
that each node of T corresponds to a matching parenthesis pair in F(T). 

Fact 3 Let Vi be the i-th node in the preordering of a simple rooted tree T. 

1. The parenthesis pair for Vi encloses that for Vj in F{T) if and only if Vi is an ancestor 

ofVj. 
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2. The parenthesis pair for Vi precedes that for Vj in F{T) if and only if Vi and Vj are not 
related and i < j. 

3. The i-th open parenthesis in F{T) belongs to the parenthesis pair for vi. 



Fact 4 (see [|20|| ) For a simple rooted tree T of n nodes, F{T) + fii{F{T)) + fi2{F{T)) is a 



weakly convenient encoding of 2n + o{n) bits. 



Based on Theorem |2.1| , we show that Fact |^ holds even if F(T) is interleaved with other 
types of parentheses. 

Theorem 2.2 Let T be a simple rooted tree. Let S be a string of 0{1) types of parentheses 
such that a given type of parentheses in S gives F{T). Then S + a{S) is a weakly convenient 
encoding ofT. 

Proof. Let the parentheses, denoted by ( and ) , in S" used by F(T) be the k-th type. 
Let Vi, . . . ,Vn be the preordering of T. Let pi = select(5', i, () and = match(S',pj); i.e., 
S[pi] and S[qi] are the matching parenthesis pair corresponding to Vi by Fact By 
Theorem |2.1| , each pi and qi are obtainable from S' + a(S') in 0(1) time. Moreover, the index 
i is obtainable from pi or in 0(1) time hj i = rank(5',pi, () = rank(S', match (S*, gj), (). 
The queries for T are supported as follows. 

Case 1: adjacency queries. Suppose i < j. Then, {pi,qi) = enclosefc(pj, g^) if and only if 
Vi is adjacent to Vj in T, i.e., Vi is the parent of Vj in T. 

Case 2: neighbor queries. Suppose that Vi has degree d in T. The neighbors of f « in T can 
be listed in 0{d) time as follows. First, if i 7^ 1, output Vj, where {pj,qj) = enclose fc (pj, gi). 
Then, let pj = firstfc(pj). As long as pj < qi, we repeatedly output Vj and update pj by 
firstfc (match {pj ) ) . 

Case 3: degree queries. Since T is simple, the degree d of Vi in T is the number of 
neighbors in T, which is obtainable in 0{d) time. □ 

The next theorem improves Theorem |2.2| and is important for our later coding schemes. 
A related result in shows that a fc-page graph of n nodes and m edges has a convenient 
encoding of 2m + 2kn + o(m + n) bits. Since T is a one-page graph, this result gives a longer 
convenient encoding for T than the next theorem. 

For a condition P, let S{P) = 1, if P holds; S{P) = 0, otherwise. 

Theorem 2.3 Let T be a rooted tree of n nodes, n* leaves and m edges. Let S + a{S) be a 
weakly convenient encoding of the simple version Tg ofT. 

1. A string D of2m — n + n* bits is obtainable in 0{m + n) time such that S + D + a{S,D) 
is a convenient encoding for T. 

2. If T = Tg, a string D of n* bits and a string Y of n bits are obtainable in 0{m + n) 
time such that S + D + a{S, D, Y) is a convenient encoding for T. 



7 



Remark. In Statement the convenient encoding contains Q:{Y) but not Y itself, which is 
only used in the decoding process and is not explicitly stored. This technique is also used in 
our other schemes. Proof. Let Vi, . . . ,Vn be the preordering of T,. Let di be the degree of 
Vi in T. We show how to use D to store the information required to obtain di in 0(1) time. 

Statement 1. Let 6i = 6{vi is internal in T^). Since S + a{S) is a weakly convenient 
encoding for Tg, each 6i is obtainable in 0(1) time from 5* + a{S). Initially, D is n copies of 
1. Let bi = di — 1 — 6i. We add bi copies of right after the i-th 1 in D for each f j. Since the 
number of internal nodes in Tg is n—n*, the bit count oi D is n+J2^=i{di — ^—Si) = 2m—n+n*. 
D is obtainable from T in 0(m + ri) time. The number bi of O's right after the i-th 1 in D 
is select(D, i + 1,1) — select(D, i, 1) — 1. Since di = 1 + 6i + bi, the degree of Vi in T can be 
computed in 0(1) time from S + D + a{S, D). 

Statement 2. Let n2 be the number of nodes of degree two in T. Initially, D is n — n* —n2 
copies of 1, one for each node of degree at least three in T. Suppose that Vi is the /ij-th node in 
vi, . . . ,Vn of degree at least three. We put di—3 copies of right after the hi-th 1 in D. The bit 
count of D is {n-n*-n2)+T,i,di>3{di-3) = {n-n*-n2) + {J27=i di-n* -2n2)-3{n-n* -n2) = 
n* - 2 < n*. 

Since 5* + a{S) is a weakly convenient encoding for T, it takes 0(1) time to determine 
whether di> 3 from S + a{S). If di < 3, di can also be computed in 0(1) time from S + a{S). 
To compute di when di > 3, note that since di = 3 + select(D, hi + 1,1)— select(i5, hi, 1) — 1, 
it suffices to compute hi in 0(1) time. Let Y be an n-bit string such that Y[i] = 1 if and only 
if di > 3. Then, hi = rank(y, 1), obtainable in 0(1) time from Y + a(Y). Each symbol 
Y[i] can be determined from S + a{S) in 0(1) time, and we do not need to store Y in our 
encoding. □ 



2.4 Canonical Orderings 



This section reviews canonical orderings of plane graphs P,|T3l and proves new properties 
needed in our coding schemes. 

All graphs in this section are simple. Let G be a plane graph. Let vi,V2, . . . ,Vn be an 
ordering of the nodes of G. Let Gi be the subgraph of G induced hj vi,V2, ■ ■ ■ ,Vi. Let Hi be 
the boundary of the exterior face of Gi. This ordering is canonical if the interval [3, n] can be 
partitioned into Ii, ... ,1k with the following properties for each Ij. Suppose Ij = [k,k + q]. 
Let Cj be the path Vk, Vk+i, ■ ■ ■ , Vk+q- 

• Gk+q is biconnected. Hk+q contains the edge (ui,f2) and Cj. Cj has no chord in G. 

Remark. Since Hk+q is a cycle, to enhance visual intuitions, we draw its nodes in the 
clockwise order from left to right above the edge {vi, V2). 

• If g = 0, ffc has at least two neighbors in Gk-i, all on Hk-i. If g > 0, Cj has exactly 
two neighbors in Gk-i, both on Hk^i, where the left neighbor is incident to Cj only at 
Vk and the right neighbor only at Vk+q- 

Remark. Whether g = or not, let and Vr denote the leftmost neighbor and the 
rightmost neighbor of Cj on Hk-i- 



For each Vi where k < i < k + q, if i < n, Vi has at least one neighbor in G — G 



k+q- 
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Figure 2: A triconnected plane graph G and a canonical ordering of G. 




Figure 3: (1) a canonical ordering of a plane triangulation G; (2) a realizer of G. 

Figure ^ shows a canonical ordering of a triconnected plane graph; Figure |^(1) illustrates 
one for a plane triangulation. 

Fact 5 (see [|,[T^) 

1. If G is triconnected or triangulated, then it has a canonical ordering that can be con- 
structed in linear time. 

2. For every canonical ordering of a triangulated G, 

• each Ij consists of exactly one node, i.e., q = 0; 

• the neighbors ofv^ in G^-i form a subinterval of the path H^^i — {{vi, V2)}, where 
H2 — {{vi,V2)} is regarded as the edge (fi,f2) itself. 

Given a canonical ordering of G with its unique partition Ii, I2, . . . , Ir, G = Gn is 
obtainable from G2 = {{vi,V2)} in K steps, one for each Ij. Step j obtains Gk+q from Gk-i 
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by adding the path Vk,Vk+i, ■ ■ ■ ,Vk+q and its incident edges to Gk-i- This process is called 
the construction algorithm for G corresponding to the ordering. 

For the given ordering, the canonical spanning tree T of G rooted at Vi is the one formed 
by the edge (^1,^2) together with the paths Cj and the edges {v£, Vk) over all Ij. In Figures || 
and 0(1), T is indicated by thick lines. 

Lemma 2.4 



1. For every edge {vi,Vir) in G ~T, Vi and Vi' are not related in T. 

2. For each node Vi, the edges incident to Vi in G form the following pattern around Vi in 
the counterclockwise order: an edge to its parent in T; followed by a block of nontree 
edges to lower-numbered nodes; followed by a block of tree edges to its children in T; 
followed by a block of nontree edges to higher-numbered nodes, where a block may be 
empty. 

Proof. 

Statement 1. Suppose that is added at step j of the construction algorithm for 

G. Either Vi or Vi' is on the path Vk,Vk+i, ■ ■ ■ ,Vk+q and the other is to the right of ve on 
Hk-i- Hence Vi is neither an ancestor nor a descendant of Vii in T. 

Statement 2. Suppose that Vi is added at step j. The tree edge from Vi to its parent, 
i.e., f£ or and all the nontree edges between Vi and its lower- numbered neighbors are 
added during this step; if z < + g, no such nontree edge exists. All such nontree edges 
precede other edges incident to Vi in the counterclockwise order. Any edge e = with 
i < i' is added during step f with j < f . Let If = [k', k' + q']. Thus, e is a tree edge only if 
i' = k' and f j is the leftmost neighbor of f j' in Hi/_i for otherwise e would be a nontree edge. 
The tree edges between Vi and its children, which are higher numbered, precede the nontree 
edges between Vi and its higher-numbered neighbors in the counterclockwise order. □ 

Let T' be a tree embedded on the plane. Let {x, y) be an edge of T' . The counterclockwise 
preordering of T' starting at x and y is defined as follows. We perform a preorder traversal 
on T' starting at x and using (x, y) as the first visited edge. Once a node v is visited via 
the unvisited nodes adjacent to v are visited in the counterclockwise order around v 
starting from the first edge following {w,v). 

Fact 6 (see [p|, |14|| ) For every triconnected plane graph, the counterclockwise preordering 
of any canonical spanning tree is also a canonical ordering of the graph. 

Remark. The canonical ordering in Figure |] is the counterclockwise preordering of T . 

Assume that G is a triangulation with exterior nodes vi,V2,Vn in the counterclockwise 
order. A realizer of G is a partition of the interior edges of G into three trees Ti,T2,T„ 
rooted at fi,f2,fn respectively with the following properties [p4| : 



1. All the interior edges incident to vi (t>2 or respectively) belong to Ti (T2 or T„ 
respectively) and oriented to vi {v2 or f„, respectively). 
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2. For each interior node w, the edges incident to v form the following pattern around v 
in the counterclockwise order: an edge in Ti leaving v; followed by a block of edges in 
Tn entering v; an edge in T2 leaving v; followed by a block of edges in Ti entering v; 
an edge in T„ leaving v; followed by a block of edges in T2 entering v, where a block 
may be empty. 

Figure ^(2) illustrates a realizer of the plane triangulation of Figure ^(1). The next fact 
relates a canonical ordering and a realizer via counterclockwise tree preordering. 



Fact 7 (see [14, |24|| ) Let G be a plane triangulation. 



1. Let vi,V2, ■ ■ ■ ,Vn be a canonical ordering of G. Note that each Ij consists of a node 
Vk- Orient and partition the interior edges of G into three subsets Ti,T2,T„ as follows. 
For each Vk with k >3, {vk,V£) is in Ti oriented to v^; {vk,Vr) is in T2 oriented to Vri 
the edges {vk, f «) where i < i < r are in Tn oriented to v^- Then Ti, T2, is a realizer 
of G. Consequently, every plane triangulation has a realizer that can be constructed in 
linear time. 

2. For a realizer Ti, T2, T„ of G, let T = Ti U {(f 1, V2), (f 1, fn)}- Let vi, V2, ■ ■ ■ ,Vn be the 
counterclockwise preordering ofT that starts at vi and uses (fi,f2) as the first visited 
edge. Then vi,V2, ■ ■ ■ ,Vn is a canonical ordering of G, and T is a canonical spanning 
tree rooted at vi . 



In Figure 0(1), the tree T stated in Fact |^(|^) is indicated by thick lines, and the canonical 
ordering shown is the counterclockwise preordering of T. 



3 Schemes with Query Support 



This section presents our coding schemes that support queries. We give a weakly convenient 
encoding in 



3.1. 



This encoding illustrates the basic techniques applicable to our coding 
schemes with query support. We then give the schemes for triconnected, triangulated, and 
general plane graphs in §p]2|, §p.3|, and 



respectively. We show how to accommodate 



self-loops in 



3.1 Basic Techniques 

• Let Gs be a simple plane graph with n nodes and edges. 

• Let T be a spanning tree of Gg that satisfies Lemma |2.4| . Let n* be the number of 
leaves in T. Let vi, . . . ,Vn be the counterclockwise preordering of T. 

• Let Ga be a graph obtained from Gg by adding multiple edges between adjacent nodes 
in Gs — T. Let ma be the number of edges in Ga, counting multiple edges. 

We now give a weakly convenient encoding for Ga using parentheses to encode T and brackets 
to encode the edges in Ga — T. Initially, let 5* = F(T). Let d and )i be the parenthesis 
pair corresponding to Vi in S. We insert into S a pair and ]e for each edge e = {vi,Vj) of 
Ga — T with i < j as follows. 
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• Ee is placed right after )j, and 

• ]e is placed right after (j. 

For example, the string S for the graph in Figure ^ is: 

(()[[[(](])[[(])[[[)[(]])[(]])[(]]]])) 
122 34455 366778 81 

Note that if Vi is adjacent to ii lower-numbered nodes and hi higher- numbered nodes in 
Ga — T, then in S the open parenthesis d is immediately followed by £i close brackets, and 
the close parenthesis by /ij open brackets. 

Lemma 3.1 The last parenthesis that precedes an open {respectively, close) bracket in S is 
close {respectively, open). 

Proof. Straightforward. □ 

Let e = {vi,Vj) be an edge of — T with i < j. By Lemma Vi and Vj are not 

related. By Fact )i precedes (j in S. Also, [e precedes ]e in 5 for every edge e in 

Ga — T, counting multiple edges. Note that [g and ] e do not necessarily match each other 
in S. In the next lemma, let S[p] < S[q] denote that S[p] precedes S[q] in S, i.e., p < q. 

Lemma 3.2 Let e and f be two edges in Ga ~ T with no common endpoint. If < [/, 
then either [g < le < [f < ] / or < [/<]/< le- 

Proof. Suppose e = {vi,Vj) and / = {vk,Vh), where i < j and k < h. Assume for a 
contradiction that [e< [/<]e<]/- Since e and / have no common endpoint, )« < )fc < 
(j < (h- There are four possible cases: 

1. (fc < < )i < )fc < (j < (h<)h< )j; see Figure 1(1). 

2. (i < < (fc < )fc < (j < (h<)h< )j; see Figure 1(2). 

3. (k < (t <h <)k < (j < )j < (h<)h; see Figure |(3). 

4. (i <)^ < (k <)k < (j < )j < (h<)h] see Figure |(4). 

In Figure ^ the dark lines are paths in T and the dashed ones are edges in Ga — T. The 
relation among these lines follows from Fact ^ and Lemma p.4| (p|). In all the cases, e crosses 
/, contradicting the fact that Ga is a plane graph. □ 

By Lemma ] g and the bracket that matches [g in S are in the same block of brackets. 
From here onwards, we rename the close brackets by redefining Ig to be the close bracket 
that matches [g in S. Note that Lemma p.l| still holds for 5*. 

Lemma 3.3 S + a{S) is a weakly convenient encoding for Ga- 
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(3) 



(4) 



Figure 4: Edge crossing 

Proof. Since T is simple, by Theorem S + a{S) is a weakly convenient encoding for T. 
We next show that S + a{S) is also a weakly convenient encoding for Ga — T. Let pi and 
be the positions of d and )« in 5, respectively. 

Case 1: adjacency queries. Suppose i < j. Then, Vi and Vj are adjacent in Ga — T if and 
only if gj < p < g < firsti(pj), where (p, q) = enclose2(firsti(gj),pj) as shown below. 

h [ ^ g ] ^ 

gi p firsti(gj) pj q firsti(pj) 

Case 2: neighbor and degree queries. The neighbors and thus the degree of a degree-ci 
node Vi in Ga — T are obtainable in 0{d) time as follows. 

For each position p such that qi < p < firsti(gi), we output vj, where pj = lasti(match(p)) 
as shown below. Note that (fj, vj) is an edge in — T with j > i. 

). [ (j ] 

qi p firsti(gj) pj match(p) 

For each position g such that pi < q < firsti(pj), we output vj, where qj = lasti(match(g)) 
as shown below. Note that {vi, vj) is an edge in — T with j < i. 

)j [ (. ] 

qj match(g) pi q firsti(pj) 

□ 

Since 151 = 2n + 2(ma — n + 1) = 2ma + 2 and S uses four symbols, 5* can be encoded 
by + 4 bits. The next lemma improves this bit count. 
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Lemma 3.4 Let S' be a string of si parentheses and S2 brackets that satisfies Lemma 3A_ 
Then S' can be encoded by a string o/2si + S2 + o{si + S2) bits, from which each S'[i\ can be 
determined in 0(1) time. 

Proof. Let S[ and S2 be two binary strings defined as follows, both obtainable in 0(|S"|) 
time: 

• = 1 if and only if S'[i] is a parenthesis for 1 < z < si + S2; 

• 5*2 [j] = 1 if and only if the j-th parenthesis in S' is open for 1 < j < si. 

Each S'[i\ can be determined from S[ + S'2 + Oi{S[) in 0(1) time as follows. Let j = 
rank(S'(, i, 1). If S[ [i] = 1, S'[i] is a parenthesis. Whether it is open or close can be determined 
from S'gfj]. If S[[i] = 0, S'[i] is a bracket. Whether it is open or close can be determined 



from S'2 [select ( 5^, rank (5^, z, 1), 1)] by Lemma ^]T]. □ 



The next lemma summarizes the above discussion. 

Lemma 3.5 Ga has a weakly convenient encoding of 2ma + 2n + o{ma + n) bits, from which 
the degree of a node in Ga — T is obtainable in 0(1) time. 



Proof. This lemma follows from Lemmas |3.1| , |3.3| , and and the fact that S contains 2n 
parentheses and 2{ma — n + 1) brackets. □ 

3.2 Triconnected Plane Graphs 

This section adopts all the notation of §3.1| with the following further definitions. 

• Let G be a triconnected plane graph. 

• Let Gs be the simple version of G. 



Let T be a canonical spanning tree of Gs , which therefore satisfies Lemma . 



Note that G is obtained from Ga be adding multiple edges between adjacent nodes in T. We 
next show that the weakly convenient encoding for Ga in Lemma |3.5| can be shortened to 
2{ma + n — n*) + o(n) bits. We also give a convenient encoding for Ga of 2ma + 2n + o(n) 
bits. Then we augment both encodings to accommodate multiple edges in T. This gives 
encodings of G. 

Let Vh be a leaf of T with 2 < h < n. By the definitions of T and a canonical ordering, Vh 
is adjacent to at least one higher- numbered node and at least two distinct lower- numbered 
nodes in Ga. By the definition of T, the parent of Vh in T, i.e., the only neighbor of Vh 
in T, has a lower number than Vh. Thus, Vh is adjacent to a higher- numbered node and a 
lower-numbered one in Ga — T. Thus, ih is immediately succeeded by a ] , and )/i by a [. 
With these observations, we can remove a pair of brackets for every Vh from S without losing 
any information on Ga as follows. Let P be the string obtained from S by removing the ] 
that immediately succeeds (h as well as the [ that immediately succeeds for every Vh- 
Let Q be the string obtained from S by removing (h and )/i for every Vh- 
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Note that \P\ = \Q\. Also, Q[i] is obtainable from P + a{P) in 0(1) time as follows: 

if P[{\ = (, P[firsti(P,z)] = ), and 2 < rank(P,i, () < n; 
Q[i] = ■{ I if P[{\ = ) , P[lasti(P, i)] = (, and 2 < rank(P, i, Q < n; 
otherwise. 

Lemma 3.6 P + Q + a{P, Q) is a weakly convenient encoding for Ga, and a convenient 
encoding for Ga — T. 




Proof. Note that the parentheses in P form F{T). Thus, by Theorem |2.2| , it suffices to 
show that P + Q + a{P, Q) is a convenient encoding for — T as follows. 

Case 1: adjacency queries. Given i < j, let {p,q) = enclose2(Q, firsti(P, — 1); the 

— 1 in the last parameter accounts for the possibility that Q\pj] is a bracket. Note that Vi 
is adjacent to Vj if and only if qi < p < q < firsti(P, pj) as shown below. Here, the first 
inequahty accounts for the possibility of Q[qi] being a bracket. 

P ). (, 
Q [ ] 

qi p firsti(P, gi) pj q firsti(P,pj) 

Case 2: neighbor queries. The neighbors of Vi can be listed as follows. 

For every position p with q^ — S{Q[qi] = [) < p < firsti(P, g^), we output Vj, where 
Pj = lasti(P, match(Q,p) + 1) as shown below. Note that the +1 in the last parameter 
accounts for the possibility of P [match being a parenthesis. Also, {vi,Vj) is an edge 
in Ga — T with j > i. 

P h (i 

Q [ ] 

qi p firsti(P, gj) pj ma.tch{Q,p) 

For every position g with pi — S{Q[pi] = ]) < g < firsti(P,pj), we output Vj, where 
qj = lasti(P, match(Q, g) + 1) as shown below. Note that the +1 in the last parameter 
accounts for the possibility of P[match((5, g)] being a parenthesis. Note that {vi,Vj) is an 
edge in — T with j < i. 

P ), u 
Q [ ] 

qj match(Q,g) pi q firsti(P,Pi) 

Case 3: degree queries. The degree of Vi in Ga — T is firsti(P, g^) — gi + S{Q[qi] = 
[) + firsti(P,pi) -pi + 6{Q[pi] = ] ) - 2, obtainable from P + Q + a{P, Q) in 0(1) time. □ 

Lemma 3.7 Ga has a weakly convenient encoding of 2ma + 2n — 2n* + o{ma + n) bits, 
from which the degree of a node in Ga — T is obtainable in 0(1) time. Moreover, Ga has a 
convenient encoding of 2ma + 2n — n* + o{ma + n) bits. 
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Proof. Since each Q[i] is obtainable from P + a{P) in 0(1) time, by Lemma 3^, P + a{P, Q) 



is also a weakly convenient encoding for Ga- Since S satisfies Lemma ^]T] and P is obtained 
from S by removing some brackets, P also satisfies Lemma p.l| . Since P has 2n parentheses 
and 2{ma — {n — 1) — n*) brackets, by Lemma |3.4| , Ga has a weakly convenient encoding of 
2[ma+n—n*)+o{ma+n) bits. To augment this weakly convenient encoding into a convenient 
one, note that the degree of Vi in Ga — T is obtainable in 0(1) time from P + Q + a{P, Q). 
By Theorem |2.3| (^, n* + o{n) additional bits suffice for supporting a degree query for T in 
0(1) time. Thus, Ga has a convenient encoding of 2ma + 2n — n* + o{ma + n) bits. □ 

The next theorem summarizes the above discussion and extends Lemma |3.7| to accom- 
modate multiple edges in T. 

Theorem 3.8 Let G he a triconnected plane graph of n nodes and m edges. Let Gg be the 
simple version of G with nig edges. Let n* he the number of leaves in a canonical spanning 
treeT ofGg. ThenG {respectively, Gs) has a convenient encoding of2m + 3n — n* + o{m + n) 
{respectively, 2ms + 2n — n* + o{n)) hits. 



Proof. The statement for Gs follows immediately from Lemma p.7| with Ga 

To prove the statement for G, let Ga be the graph obtained from Gs by adding the 
multiple edges of G between adjacent nodes in Gs — T . By Lemma if Ga has edges, 
then Ga has a weakly convenient encoding of 2{ma + n — n*) + o{ma + n) bits, from which a 
degree query for Ga — T takes 0(1) time. Next, let = G — {Ga — T). To support degree 
queries for T^, note that T^, is a multiple tree of n nodes and m — ma + n — 1 edges. By 
Theorem |2.3| (|ID, 2{m — ma + n — l)—n + n* + o{m) additional bits suffice for supporting a 
degree query of in 0(1) time. Thus, G has a convenient encoding of 2m + 3n — n* + o(m) 
bits. □ 



3.3 Plane Triangulations 



Since every plane triangulation is triconnected, all the coding schemes of Theorem 



are 



applicable to plane triangulations. The next theorem shortens their encodings. The theorem 
and its proof adopt the notation of ^3.1| and § |3.2| . 



Theorem 3.9 Assume that G is a plane triangulation of n nodes and m edges. Let Gg he 
the simple version of G with ms = 3n — 6 edges. Then G {respectively, Gs) has a convenient 
encoding of 2m + 2n + o{m + n) {respectively, 2ms + n + o{n)) bits. 

Proof. By the definition of a canonical ordering, every Vi with 1 < i < n is adjacent to 
a higher-numbered and a lower-numbered node in Gs — T . Thus when computing the P of 



3.2 from 5, we can also remove the [ right after )i even if Vi is internal in T. Then, the 



string Q of length |P| is redefined as follows: 

if P\i] = (, P[firsti(P,i)] = ), and 2 < rank(P,i, () < n; 
Q[i] = { I if P[i] = ) and 1 < rank(P, i, Q < n; 
otherwise. 




The proof of Lemma |3.6| works identically. Since the count of brackets decreases by n — n* + 
0(1), each encoding in Theorem |3.8| has n — n* + 0(1) fewer bits. □ 
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3.4 General Plane Graphs 

This section assumes that if a plane graph has more than one connected component, then 
no connected component is inside an interior face of another connected component. 

Let Gs be a simple plane graph with n nodes, rhs edges, and c connected components 
Ml, M25 • • • 5 Let hj and rhj be the numbers of nodes and edges in Mj. 

For each M^-, we define a graph Mj as follows. If fij < 3, let Mj = Mj. If fij > 3, let Mj 
be a graph obtained by triangulating Mj. Among the 3hj — 6 edges in Mj, the ones in Mj 
are called real, and the others are unreal. 

For each Mj, we define a spanning tree Tj as follows. If < 3, let Tj be an arbitrary 
rooted spanning tree of Mj. For fij > 3, recall that by Fact |3(|l|), Mj has a realizer formed 
by three edge-disjoint trees. Furthermore, three canonical spanning trees Tj,Tj,Tj of Mj 
are obtainable by adding to each of these three trees two boundary edges of the exterior face 
of Mj. Let Tj be a tree among Tj, Tj, Tj with the least number of unreal edges. 

Let T be the tree rooted at a new node fo by joining the root of each Tj to Vq with an 
unreal edge; note that T is obtainable in 0{n) time by Fact |^. Let rriu be the number of the 
unreal edges of T; thus, T has n — ruu real edges. Let fo, f 1, f2, . . . , fn be a counterclockwise 
preordering of T. Let di be the degree of Vi in T. Let be the number of nodes of degree 
more than k in T. 

Let Gs be the simple graph composed of the edges in Gs and the unreal edges in T. A 
node of Gg is real if its incidental edge to its parent in T is real; note that each child of Vq 
in T is unreal. 

Let Ea be a set of ia multiple edges between adjacent nodes in Gg — T. Let Ga = GsUEa 
and Ga = GsU Ea- Let and be the numbers of edges in G„ and Ga, respectively; i.e., 

ma = ms + ia and rha = fhs + L- 

Lemma 3.10 

1. rUu <n~ ^fhs. 

2. rUu — c < |n. 

3. Nk < f . 
Proof. 

Statement 0. Let Uj be the number of unreal edges of Tj. Clearly m„ = c+Ui+U2+- ■ ■+Uc. 
Since rhs = f^i + + ■ ■ • + fhc, it suffices to prove the claim that Uj < hj — ^rhj — 1 for every 
j = 1,2, ... ,c. For hj < 2, the claim holds trivially. Now suppose hj > 3. For t = 1,2,3, 
let rt and Ut be the numbers of real and unreal edges in Tj, respectively. Since the three 
trees in a realizer of Mj are edge disjoint, ri + r2 + r^ + ui + U2 + U3 — 6 = 3hj — 9. Since 
ri + r2 + r^ > hij and Mi + M2 + Ms > 3uj, the claim holds. 

Statement H. 

1 12 
mu-c= 51 «i < Yl - oS- - 1) < II {nj - -{hj - 1) - 1) < -n. 

l<i<c l<j<c l<J<c 
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Statement 0. Let rii be the number of nodes of degree i in T. Since T is a tree of n + 1 
nodes, we have 

2n = ■ rii > n* + n2 -\ \- Uk + {k + 1) ■ Nk] 

i>l 

n + 1 = n*+n2-\ + rik + Nj^. 

-^fc < f follows immediately. □ 
Lemma 3.11 

1. Ga has a weakly convenient encoding of2rha + 2m„ + 3n + o{rha + n) hits, from which 
the degree of a node in Ga — T is obtainable in 0(1) time. 

2. Ga has a convenient encoding of 2rha + + (4 + i)n + o{rha + n) bits, for any positive 
constant k. 



Proof. 

Statement |I|. Since each Tj is a spanning tree of Mj that satisfies Lemma |2]^, T is also 



a spanning tree of Gs that satisfies Lemma p.4| . Then, by Lemma |3.5| , Ga has a weakly 
convenient encoding of 2ma + 2n + o{ma + n) bits, from which the degree of a node in — T 
is obtainable in 0(1) time. We next extend this encoding to a desired weakly convenient 
encoding X for Ga- Since Ga — T = Ga — T, it suffices to add an n-bit binary string R such 
that R[i] = 1 if and only if Vi is real. Since = Tha + the statement follows. 

Statement ^ To augment the above encoding X into a convenient one for Ga, it suffices 
to support in 0(1) time a query on the number of real children of Vi in T. Fix an integer 
k. Let D be a binary string that contains A^^ copies of 1. If Vi is the /ij-th node in f i, . . . , f„ 
of degree more than k in T, we put copies of right after the hi-th 1 in D. The length 
of D is at most + n — mu- Since k = 0(1), by the definition of a weakly convenient 
encoding, it takes 0(1) time to determine whether di > k from X. If di < k, di and thus 
the number of real neighbors of Vi in T can be computed in 0(1) time from X. If di > k, 
the number of real neighbors of Vi in T is select(D, hi + 1,1) — select(D, hi, 1) — 1 + R[i]. 
To compute hi in 0(1) time, let Y be an n-bit binary string such that Y[i] = 1 if and 
only if di > k. Clearly if di > k, then hi = rank(y, i, 1), computable in 0(1) time from 
Y + a{Y). Since each Y[i] can be determined in 0(1) time from X, Y need not be stored 
in our encoding. In summary, X + D + a{D,Y) is a convenient encoding for Ga, which can 
be coded in 2ma + rnu + An + Nk + o(ma + n) bits. The statement follows immediately from 
Lemma p:TU|(|3|). □ 

The next theorem summarizes the above discussion and extends Lemma p.ll| to accom- 
modate multiple edges in T. 

Theorem 3.12 Let G be a plane graph of n nodes and m edges. Assume that Gg is the 
simple version of G. 

1. G {respectively, Gs) has a weakly convenient encoding of bit count 2m + + o[m + n) 
{respectively, ^rhs + 5n + o{n)). 
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2. G {respectively, Gg) has a convenient encoding of2rh+{5 + ^)n+o{rh+n) {respectively. 



+ (5 + x)^ + o{n)) bits, for any positive constant k. 



Proof. The statements for Gs follow immediately from Lemmas p.lO| (]I|) and |3.11| with 
Ga = Gs- To prove the statements for G, we first choose Ea to be the set of multiple edges 
such that {Gs — T) U Ea is composed of the multiple edges of G between adjacent nodes in 
Gs — T. Also, let Eb = G — Ga] let h be the number of edges in Eb. 

Statement Continuing the proof of Lemma |3.11| (P^, we augment the weakly convenient 



encoding X for Ga into one for G. We support in 0(1) time a query for the number of 
multiple edges of G between Vi and its parent in T as follows. 

Initially, Lq is n — c copies of 1, one for each node that is not in the first two levels of T; 
recall that all nodes in the first two levels of T are unreal. For 1 < i < n, suppose that Vi 
is the grth node in vi, . . . ,Vn that is not in the first two levels of T. We put copies of 
right after the grth 1 in Lg. Since G has n + ib — edges between adjacent nodes in T, Lq 
has 2n — c + ib — f^u bits. 

Let L be an n-bit binary string such that for 1 < i < n, L[i] = 1 ii and only if Vi is 
not in the first two levels of T. Clearly if L[i] = 1, then gi = rank(L, z,l). Since L[i] is 
obtainable from X in 0(1) time, a, is obtainable in 0(1) time from X + Lq + a{Lo,L). 
Moreover, since R[i] = 1 if and only Oj > 1, i? can be removed from X. Thus, G has a 
weakly convenient encoding X of 2rha + m.^ + An — c + ib + o{rha + n) bits. The statement 
follows from Lemma p.l0| (3) and the fact that m = rha + lb- 



Statement 0. We now augment the above encoding X into a convenient one for G. It 
suffices to support in 0(1) time a query on the number of the real multiple edges G 
between Vi and its children in T. Initially, D is copies of 1. Suppose that Vi is the /ij-th 
node in Wi, . . . , f„ of degree more than k in T. We put rj copies of right after the /ij-th 
1 in D. As in the proof of Lemma |3.11| (^, hi is obtainable from Y + a{Y) in 0(1) time. 



where Y is not stored in the encoding. If di > k, Vi is computable as select(D, hi + 1,1) — 
select(D, hi, 1) — 1 in 0(1) time. If di < k, ri is computable in 0{k) time from X. D has 
at most Nk + n + ib — niu bits. Hence G has a convenient encoding X + D + a{D,Y) of 
2ma + ^n + 2lb + Nk — c+ o{ma+ ib + n) bits. Then, this statement follows from Lemma |3.1CI| (^ 
and the fact m = rha + ib- D 

3.5 Graphs with Self-loops 



Remark. The encodings of Theorems |3.^ , p.9| , and |3.12| assume that G has no self-loops. 
To facilitate the coding of self-loops, we assume that the self-loops incident to a node in 
a plane graph are recorded at that node by their number. Then, to augment each cited 
encoding to accommodate self-loops, we only need to add 1 to the coefficient of the term 
n in the bit count as follows. Initially, Z is n copies of 1. Then, for 1 < i < n, we put Zi 
copies of right after the i-th 1 in Z, where Zi is the number of self-loops incident to f We 
augment the encoding in question with Z by means of Fact |1]. Since the bit count of Z is 
n plus the number of self-loops, our claims follows from the fact that the coefficient of the 
term m in the bit count in question is at least one. 
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4 More Compact Schemes 



For applications that require no query support, we obtain more compact encodings for tri- 
connected plane graphs in this section. All graphs in this section are simple. 

Let G be a triconnected plane graph with n > 3 nodes. Let T be a canonical spanning 
tree of G. Let fi, . . . , f„ be the counterclockwise preordering of T, which by Fact ^ is also a 
canonical ordering of G. 

Let Ii, . . . ,Ik be the interval partition for the ordering f i, . . . , f„. Recall that the con- 
struction algorithm of builds G from a single edge (^1,^2) through a sequence of K 
steps. The j-th step corresponds to the interval Ij = [k,k + q]. There are two cases, which 
are used throughout this section. 

Case 1: q = 0, and a single node Vk is added. 

Case 2: g > 0, and a chain of g + 1 nodes Vk, ■ ■ ■ , v^+q is added. 

The last node added during a step is called type a; the other nodes are type b. Thus for 
a Case 1 step, Vk is type a. For a Case 2 step, Vk, f fc+i, • • • , Vk+q-i are type b, and the node 
Vk+q is type a. To define further terms, let ci = vi,C2, ■ ■ ■ ,Ct = V2 be the nodes of Hk-i 
ordered consecutively along Hk-i from left to right above the edge (^1,^2). 

Case 1. Let q and c^, where 1 < £ < r < t, be the leftmost and rightmost neighbors of Vk 
in Hk-i, respectively. The edge {cr,Vk) is called external. The edges {ci,Vk) where i < i < r, 
if present, are internal. Note that (q, Vk) is in T. 

Case 2. Let q and c^, where 1 < i < r < t, he the neighbors of Vk and v^+q in H^-i, re- 
spectively. The edge {cr,Vk) is called external. Observe that the edges {ce,Vk), (f/c,ffc+i), ■ ■ ■, 
{vk+q-i,Vk+q) are in T. 

For each Vh, where 1 < /i < ri — 1, let Biyu) denote the edge set {{yh,Vj) \ h < j}. By 
the definition of a canonical ordering and Lemma |27^ , the edges in B{vh) form the following 
pattern around Vh in the counterclockwise order: a block (maybe empty) of tree edges; 
followed by at most one internal edge; followed by a block (maybe empty) of external edges. 
Note that -B(f 1), B{v2), . . . , B{vn-i) form a partition of the edges of G. Also, B{vh) is not 
empty since by the definition of a canonical ordering, every Vh is adjacent to some Vj with 
h<j. 

Lemma 4.1 Given B{vh) for 1 < h < n — 1 and the type of for 3 < h < n, we can 
uniquely reconstruct G. 

Proof. We first draw (fi,t'2) and then perform the following K steps. Step j processes 
Ij = [k,k + q]. Before this step, Gk-i and Hk-i have been built. Let ci = fi, C2, . . . , Q = f2 
be the nodes on Hk-i from left to right. We know the numbers of remaining tree and external 
edges at each q, i.e., those in B{ci) not yet added to G. We next find the leftmost neighbor 
Ci and the rightmost neighbor of the nodes added during this step. Note that (q, v^) is in 
T. Since Vi, . . . ,Vn is the counterclockwise preordering of T, q is the rightmost node with 
a remaining tree edge; Cr is the leftmost node to the right of q with a remaining external 
edge. There are two cases: 

If Vk is type a, then this is a Case 1 step and Vk is the single node added during this step. 
We add (q,^^) and {cr,Vk). For each Cj with i < i < r, ii B{ci) contains an internal edge, 
we also add (q, Vk). 
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If Vk is type b, then this is a Case 2 step. Let q be the integer such that Vk,Vk+i, ■ ■ ■ , 
Vk+g-i are type b and Vk+g is type a. The chain Vk, ■ ■ ■ , v^+q is added between q and c^. 

Finally, the number of remaining tree (respectively, external) edges at q (respectively, 
Cr) decreases by 1. The numbers of tree, internal, and external edges remaining at each Vi 
for k < i < k + q are set to those of all tree, internal, and external edges in B[vi). This 
finishes the j-th step. When the K-th step ends, we have G. □ 

By Lemma ^?T| , we can encode G by encoding the types of all Vh and B{vh) for 1 < /i < 
n — 1 using two strings Si and 5*2. 5*1 is a binary string containing one bit for each Vh-, 
indicating the type of Vh- S2 encodes the sets B{vh) using three symbols 0, 1, *. The code for 
B{vh) is a block of O's, followed by a block of I's, followed by a block of *'s. The number of 
O's (respectively, I's and *'s) in the first (respectively, second and third) block is that of the 
tree (respectively, external and internal) edges in B{vh)- However, since these three numbers 
can be zero, we need a fourth symbol to separate the codes for B{vh)- Now if we use two 
bits to encode each of the 4 symbols used in 5*2, then S2 has a longer binary encoding than 
desired. We next present a shorter encoding by eliminating the symbol used to separate the 
codes for B{vh)- 

The type of B{vh) is defined to be a combination of symbols T, X and /, which denote the 
existences of tree, external or internal edges in B{vh), respectively. For example, if B{vh) is 
type TI, then it has at least one tree edge, exactly one internal edge, and no external edge; 
recall that each B{vh) has at most one internal edge. Moreover, for all Vh of type a, if B{vh) 
has no tree edge, then we call Vh type al; otherwise, Vh is type a2. For Vh of type b, since Vh 
is added in a Case 2 step and is not the last node added, B{vh) has at least one tree edge 
and thus no similar typing is needed. 

Our encoding of G uses two strings and 5*2. 5*1 has length n. For 1 < h < n, Si[h] 
indicates whether Vh is type al, a2, or b, which is recorded by symbols 0, 1, or *, respectively. 
For convenience, let Vi be type a2 and V2 be type al. 5*2 uses the same three symbols to 
encode B{vh) for 1 < h < n — 1. B{vh) is specified by a codeword code[t>/j] defined in Figure 
^. 5*2 is the concatenation of the codewords code[f/i]. 

Lemma 4.2 For 1 < h < n — 1, the sets B{vh) and the types of all Vh can be uniquely 
determined from Si and 5*2. 

Proof. We can look up the type of Vh in Si. To recover B{vh), we perform the following 
n — 1 steps. Before step h, we know the start index of code[f/t] in ^2. With the cases below, 
step h finds the numbers of tree, external, and internal edges in B{vh) as well as the length 
of code[f/t], which tells us the start index of code[f/i+i] in S'2. 
Case A: Vh is type al. There are three subcases. 

Case Al: code[vh\ starts with 0. Then B{vh) is type / and contains only one internal 
edge. Also, code[f/i] has length 1. 

Case A2: code[vh] starts with *. Then B{vh) is type X with (3=1 external edge. Also, 
code[uft] has length 1. 

Case A3: code[vh] starts with 1. Let B = P be the maximal block of I's in S'2 at the 
start of code [f ft]. Then, code[fft] has length 7 + 1. Let x be the symbol after in S'2. There 
are two further subcases. 

If X = *, B{yh) is type X and has /5 = 7 + 1 external edges. 
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Figure 5: This code book gives codefi'h]. The length of code[i'/i] is the number of edges 

in B{vh)- The numbers of the tree and external edges in B{vh) are denoted by a and f3, 
respectively. Recall that B{vh) contain either or 1 internal edge. The notation denotes 
a string of t copies of symbol z. A symbol T, X, or / under code[w/i] denotes the portion in 
codef^;^] corresponding to the tree, external, or internal edges, respectively. 



If a; = 0, B{vh) is type XI and has /9 = 7 external edges and one internal edge. 
Case B: Vh is type a2 or b. Then B{vh) contains at least one tree edge. There are three 
subcases. 

Case Bl: co^e[vh\ starts with *. Then B{vh) is type T and contains a — 1 tree edge. 
Also, code[i'/i] has length 1. 

Case B2: code[vh] starts with 0. Let = 0''' be the maximal block of O's in 5*2 at the 
start of code[f/j]. Then code[v/i] has length 7 + 1. Let x be the symbol after O in 5*2. There 
are two further subcases: 

li X — *, then B{vh) is type T and has a = 7 + 1 tree edges. 

If X = 1, then B(vh) is type TX and has 1 tree edge and (3 — ^ external edges. 

Case B3: code[vh] starts with 1. Let 9 = 1''' be the maximal block of I's in 5*2 at the 
start of code[f/j]. There are three further subcases: 

If * follows in 5*2, then B{vh) is type TI and has a = 7 tree edges and one internal 
edge. Also, codefi^^] has length 7 + 1. 

If 0"^* follows in 5*2, then B(vh) is type TXI and has a = 7 tree edges, P — S external 
edges, and one internal edge. Also, code[v/i] has length j + 5 + 1. 

If 0^1 follows in ^2, then B{vh) is type TX and has o; = 7 + 1 tree edges and (3 — 5 
external edges. Also, code[t'/i] has length 7 + 5 + 1. 

This completes the description of the /i-th step. In any case above, we can determine the 
length of codef^;^] and recover B{vh). □ 

The next theorem summarizes the above discussion. 
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Theorem 4.3 Let G be a simple triconnected plane graph with n > 3 nodes, m edges, and 
f faces. 

1. G can he encoded using at most log3-(n + m) + 1 hits. 

2. G can he encoded using at most log 3-(min{n, /} + m) + 2 < ^^^m + 4 hits. 

Remark. The decoding procedure assumes that the encoding of G is given together with 
n or / as appropriate, which can be appended to S by means of Fact |l|. 
Proof. 

Statement 1. In the above discussion, Si has length n, and 5*2 has length m. The encoding 
of G is the concatenation of Si and 5*2 . Treated as an integer of base 3, S uses at most 
log3-(n + m) + 1 bits. 

Statement 2. Let G* be the dual of G. G* has / nodes, m edges and n faces. Since 
G is triconnected, G* is also triconnected. Furthermore, since n > 3, / > 3 and G* has 
no self-loop or multiple edge. Thus, we can use Statement 1 to encode G* with at most 
log3-(/ + m) + 1 bits. Since G can be uniquely determined from G*, to encode G, it suffices 
to encode G*. To shorten S", if n < /, we encode G using at most log3-(n + m) + 1 bits; 
otherwise, we encode G* using at most log3-(/ + m) + 1 bits. This new encoding uses at 
most log3-(min{n, /} + m) + 1 bits. Since min{n, /} < = 0.5m + 1, the bit count is at 
most log 3- (1.5m) + 3. For the sake of decoding, we use one extra bit to denote whether we 
encode G or G*. □ 
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